A Nonparametric Scan Statistic for Multivariate Disease Surveillance

نویسنده

  • Daniel B. Neill
چکیده

OBJECTIVE We present a new method for multivariate outbreak detection, the “nonparametric scan statistic” (NPSS). NPSS enables fast and accurate detection of emerging space-time clusters using multiple disparate data streams, including nontraditional data sources where standard parametric model assumptions are incorrect. BACKGROUND Expectation-based scan statistics [1] extend the traditional spatial and space-time scan statistics [2-3] by inferring expected counts for each location from past data and detecting regions where recent counts are higher than expected. While these methods have been shown to achieve high detection power across a variety of datasets and outbreak types [4], they make strong parametric model assumptions (e.g. Poisson or Gaussian counts), and performance degrades when these models are incorrect. Our solution, NPSS, is a new scan statistic approach which does not assume a parametric model, but instead combines empirical pvalues across multiple locations, days, and data streams to discover significant disease clusters. METHODS Given a time series of observed counts ci,m for each data stream Dm at each location si, we wish to detect spatial regions where the recent counts for some subset of streams are higher than expected. To do so, we first compute empirical p-values Pi,m for each stream and location for each recent day: these are defined as (Tbeat+ 1) / (T+1), where Tbeat is the number of past days (for that data stream and location) with higher residuals and T is the total number of past days. We then scan over the space-time regions (D, S, W), each consisting of some subset of streams D for some spatial area S for the last W days. We search for regions where the Pi,m are significantly lower than expected, corresponding to higher than expected counts.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast generalized subset scan for anomalous pattern detection

We propose Fast Generalized Subset Scan (FGSS), a new method for detecting anomalous patterns in general categorical data sets. We frame the pattern detection problem as a search over subsets of data records and attributes, maximizing a nonparametric scan statistic over all such subsets. We prove that the nonparametric scan statistics possess a novel property that allows for efficient optimizat...

متن کامل

A flexibly shaped space-time scan statistic for disease outbreak detection and monitoring

BACKGROUND Early detection of disease outbreaks enables public health officials to implement disease control and prevention measures at the earliest possible time. A time periodic geographical disease surveillance system based on a cylindrical space-time scan statistic has been used extensively for disease surveillance along with the SaTScan software. In the purely spatial setting, many differe...

متن کامل

On the limiting distribution of the spatial scan statistic

Bootstrap is the standard method in the spatial scan test. However, because the spatial scan statistic lacks theoretical properties, its development and connection to mainstream statistics has been limited. Using the methods of empirical processes with a few weak regularity conditions, the limiting distribution of the spatial scan statistic, which can provide a theoretical basis for the spatial...

متن کامل

A nonparametric spatial scan statistic for continuous data

BACKGROUND Spatial scan statistics are widely used for spatial cluster detection, and several parametric models exist. For continuous data, a normal-based scan statistic can be used. However, the performance of the model has not been fully evaluated for non-normal data. METHODS We propose a nonparametric spatial scan statistic based on the Wilcoxon rank-sum test statistic and compared the per...

متن کامل

An Expectation-Based Scan Statistic for Detection of Space-Time Clusters

OBJECTIVE This paper describes a new class of space-time scan statistics designed for rapid detection of emerging disease clusters. We evaluate these methods on the task of prospective disease surveillance, and show that our methods consistently outperform the standard space-time scan statistic approach. BACKGROUND The space-time scan statistic [1] is a powerful statistical tool for prospective...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007